When considering potential bias, a proxy indicator is some combination of features that reveal the value of a protected characteristic such as gender or ethnic background. For example, Days taken as holiday from work might be indicative of a person's religion, or subjects taken at school may be related to gender. Often it is assumed that if protected characteristics are removed from a dataset before it is used for training, then the resulting machine learning system will be free of bias. However, proxy indicators mean that decisions could still effectively made based on the protected characteristic.
Used in Chap. 20: pages 321, 322
Also known as proxy measures